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% Molaoular cloning of tha gone for human antMiaemophlllo faetor IX. 

© rt has been a problem to find an alternative, lass tlmecon- 
aumlng, and mora reliable source of factor IX, a polypeptide 
which is es«*"llai to tha human blood-clotting process and ne- 
cessary for tne treatment of patients with Christmas disease. 
Tha Invention Is an Important step towards solving the problem 
by way of genetic engineering, in that it provides recombinant 
DNA containing a 0NA sequence occurring In the human fac- 
tor IX genome. It Includes recombinant ONA comprising sub- 
stantially the whole sequence of human factor IX genome In- 
serted In a cloning vehicle and transformed Into a host such as 
E.coli. It Is conveniently characterised by a 129 or 203- nucleo- 
tide long sequence (J-J- and J'-J" In Figure 9). Other frag- 
ments of the sequence have also been cloned and tha Inven- 
tion Includes ONA molecules comprising part or all of tha hu- 
man factor IX ONA, Tho invention also Includes cDNA derived 
from human factor IX RNA. Uses of the Invention Include the 
provision of an Intermediate of value In tha genetic engineering 
of a factor IX polypeptide precursor and thence manufacture of 
tha factor IX polypeptide, and in making probes for use In diag- 
nosing the presence of normal or abnormal factor IX DNA In 
patients with Christmas disease. 
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GENETIC ENGINEERING ETC 



123578 



Background of the Invention 
!• Field of the Invntion 

This invention is in the field of genetic engineering, relating 
to factor IX DNA. 
2. Description of prior art 

Factor IX (Christmas factor or antihemophilic factor ,B) is 
the symogen of a serine protease which is required for blood 
coagulation via the intrinsic pathway of clotting (Jackson & 
Nemerson, Ann. Rev. Blochem. 49, 763-811, 1980). This factor is 
syntheeisad in the liver and requires vitamin K for its bio- 
synthesis (Di Scipio & Davie, Blochem. U*, 899-904, 1979). 

• Human factor IX has been purified and characterised, but 
details of the amino acid sequence are fragmentary. It is a 
single-chain glycoprotein, with a molecular weight of approxi- 
mately 60,000 (Suomela, Eur. J.Biochem. 71, 145-134, 1976). Like 
other vitamin K-dependent plasma proteins, human factor IX contains 
in the amino- terminal region approximately 12 gamma-carboxyglutamlc 
acid residues (Di Scipio & Davie, Blochem. J£, 899-904, 1979). 

During the clotting process, and in the presence of Ca** 
ions, factor IX is acted upon by activated factor XI (XIa) by the 
cleavage of two internal peptide bonds, releasing an activation 
glycopeptide of 10,000 daltons (Di Scipio et al., J.Clin. 
Invest. 61., 1528-1538, 1978). The activated factor IX (IXa) is 
composed of two chains held together by at least onedisulphide 
bond. Factor IXa then participates in the next step in the 
coagulation cascade by acting on factor X in the presence of 
activated factor VIII, Ca* + ions, and phospholipids (Undquist 
atal., J.Biol.Chem. 253. 1902-1909, 1978). 

Individuals deficient in factor IX (Christmas disease or 
haemophilia B) show bleeding symptoms which persist throughout 
life. Bleeding may occur spontaneously or following Injury. This 
may take place virtually anywhere. Bleeding into the joints is 
common, and after repeated haemorrhages, may result in permanent 
and crippling deformities, the condition is a sex-linked disorder 
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affecting malts. Its frequency in the population la approxi- 
mately 1 In 30,000 malea. 

The current method of diagnosing Christmas disease Involves 
measurement of the tltre of factor IX In plasma by a combination 
05 of a clotting assay and an Immunochemical assay. Treatment of 
haemorrhage In the disease consists of factor IX replacement by 
means of Intravenous transfusion of human plasma protein concen- 
trates enriched In factor IX. The enrichment of plasma in 
factor IX Is a time-consuming process. 
10 Summary of the invention 

After considerable research and experiment, Important 
progress has now been made towards producing artificial humpn 
factor IX by recombinant DNA technology (genetic engineering). 
Thus, the cloning of DNA sequences which are substantially the 
15 aame as extensive sequences occurring In the human factor IX 
genome has been achieved. 

The invention arises from the finding that an extensive DNA 
sequence of the human factor IX genome can be obtained by a clever 
and laborious combination of chemical synthesis and artificial 
20 biosynthesis, starting from elementary nucleotide or dlnucleotlde 
"building blocks", as will be described below. 

A major feature of the invention comprises recombinant DNA 
which comprises a cloning vehicle DNA sequence and a sequence 
foreign thereto (I.e. foreign , to the vehicle) which is substan- 
25 tlally the same as a sequence occurring in the human factor IX 
genome. A 11873 nucleotide long part of such a foreign sequence 
haa been Identified and a very large part of it haa been sequenced 
by the Haxam-Gllbert sequencing method. A 129 nucleotide length 
of this sequence le more than sufficient to characterise it 
30 unambiguously as coding for a specific protein and a particular 
auch length is regarded herein as useful to characterlae the whole 
sequence Inserted in the cloning vehicle as one occurring in the 
human factor IX genome. Other cloned sequences can then be verified 
as belonging to the human .factor IX genome by determining that part 
35 thereof le Identical to a region of the first-mentioned sequence, 
i.e. the sequences have a common identity in an overlapping region. 
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A further feature of the invention therefore comprises recombinant 
DNA which comprises a cloning vehicle or vector DNA sequence and 
a DNA sequence foreign thereto which consists of or includes 
substantially the following sequence of 129 nucleotides (which 
should be read in rows of 30 scross the page):- 



10 



ATGTAACATG 
CCGACCACTT 
ACAACGTGCT 
ATCGACTTGC 
AACCAGCAG 



TAACATTAAG 
TTGTAAAAAT 

TTGCTCCTCT 
AGAAAACCAG 



AATGGCAGAT 
AGTGCTGATA ' 
ACTGAGGGAT 
AAGTCCTGTG 



(I) 



5 



The invention Includes particularly recombinant DNA which 
comprlsss a cloning vehicle DNA ssquence end a aequence foreign to 
the cloning vehicle, wherein the foreign sequence Includes substan- 
tially the whole of an exon sequence of the human factor IX 
genome. The 129-nucleotide sequence described above corresponds 
substantially to such an exon ssquence. Another such exon 
sequence which Independently characterises the human factor IX DNA 
ia the 203-nucleotide sequence substantially as follows (again 
reading in rows of 30 across the page):- 



TGCCATTTCC ATGTGGAAGA GTTTCTGTTT 

CACAAACTTC TAAGCTCACC CGTGCTCAGG 

CTGTTTTTCC TGATCtGGAC TATGTAAATT 

CTACTGAAGC TGAAACCATT TTCGATAACA 

TCACTCAAAG CACCCAATCA TTTAATGACT 

TCACTCGGCT TGTTGGTGGA GAAGATGCCA 

AACCAGGTCA . ATTCCCTTGG GAG 

The intron sequences of the human factor IX genome are excised 
during the transcription process by which mRNA is made in human 
cells. Only exon sequences are translated into protein. DNA coding 
for factor IX has been prepared from human mRNA. This cDNA has been 
partly sequenced and found to contain the same 129- and 203-nucleo- 
tlde sequences set out above. 



-0107278 

- 4 - 

The Invention also includes recombinant DNA which comprises a 
cloning vehicle sequence and a DNA eequence foreign to the cloning 
vehicle, wherein the foreign sequence comprise* a DNA sequence 
which is complementary to human factor IX mRNA. Such a recombinant 

OS cDNA can be Isolated from a library of recombinant cDNA clones 
derived from human liver mRNA by using an axon of the genomic 
human factor IX DNA (or part thereof) as a probe to acreen this 
library and thence isolating the resulting clones, 

<- The invention also Includes recombinant DNA in which the 

10 foreign sequence is any fragment of human factor IX DNA, particu- 
larly of length at least 50 and preferably at least 75 nucleotides 
or base-pairs* It Includes such recombinant DNA whether or not 
part of the 129 or 203-base-palr sequence defined above. It 
includes especially part, or all of the axon sequences of human 

15 factor IX genomic DNA. Various short lengths up to about 11 

kilobases (11,000 nucleotides or base-pairs) long have been prepared 
by uae of various restriction endonuc leases. Methods of isolating 
recombinant DNA from clones are well known and some i:re described 
hereinafter. The DNA of the invention can be single or double 

20 stranded form. 

The recombinant human factor IX DNA of this Invention is 
useful as a tool of recombinant DNA technology. Thus it is useful 
ae the first stage in the production of artificial human factor IX 
and in the preparation of probes for diagnostic purposes. 

25 In the production of artificial human factor IX it is 

contemplated that appropriate cDNA or genomic clones will be 
introduced into a suitable expression vector in either mammalian 
or bacterial -systems. For mammalian studies, the gene might be too 
X/n* to be conveniently retained in one clone. Therefore a suitable 

30 artificial "mlnlgene" will be designed and constructed from suitable 
parts of the cDNA and genomic clones. The mlnlgene will be under 
the control of its own promoter or instead will be replaced by an 
artificial one, perhapa the mouse metallothloneine I promoter. The 
reaultant 'mlnlgene 1 will then be Introduced into mammalian tissue 

35 culture cells e.g. a hepatoma cell line, and select loh for clones 
of cells syntheslslng maximum amounts of biologically active 
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factor IX will be carried out. Alternatively "genetic farming" 
could be employed ae haa been demon* t rated for mouse growth hormone 
(Palmlter et al, Nature 300, 611-613, 1982). The mlnlgene would 
be micro- injected into the pronucleue of fertilised eggs, followed 
03 by in vivo cloning and selection for progeny producing the largeet 
quantity of human factor IX in blood. Alternatively, it la contem- 
plated that the cDNA clone or aalected parte of it will be linked 
to a suitable atrong bacterial promotor, e.g. a Lac or Tr£ promoter 
or the lamdbe P R or P L , and a factor IX polypeptide obtelned 
10 therefrom. 

The natural factor IX polypeptide ie aynthealaed aa a 
precursor containing both a signal and propeptide region. They 
are both normally cleaved off in the production of the definitive 
length protein. Even this product la merely a precursor. It ie 
13. biologically inactive and m»st be gamma-carboxylated at 12 epeclflc 
N-terminal glutamic acid residues in the ao called 'GU 1 domain by 
the action of a specific vitamin K-dependent carboxylase. In 
addition, two carbohydrate molecules are added to the connecting 
peptide region of the molecule, but it remains unknown whether 
20 they are required for activity. The eubatrate for the carboxylaae 
is unknown and could be the precursor factor IX polypeptide or 
alternatively the definitive length protein. Therefore various 
relevant polypeptides both with and without the precursor domains 
will be "constructed" using genetic engineering methods in bacterial 
23 hosts. They will then be tested as substrates for the conversion 
of inactive to biologically active factor IV in vitro by the action 
of partially purified preparations of the carboxylase enzyme which 
can be isolated from liver microsomes or other suitable sources. 

Por diagnostic purposes, the recombinant human genomic 
factor IX DNA or recombinant human mRNA-derived factor IX DNA has 
a wide variety of usee. It can be cleaved by ensymes or combina- 
tions of two or more ensymes into shorter fragments of DNA which 
can be recomblned into the cloning vehicle, producing "sub-clones". 
Theee sub-clones can themselves be cleaved by restriction ensymes 
to DNA molecules suitable for preparing probes. A probe DNA (by 
definition) is labelled in some way, conveniently radiolabeled, 
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and can be used to examine in detail mutations In the human DNA 
vhlch ordinarily would produce factor IX. Several different 
probes have been produced for examining several different regions 
of the genome where mutation was suspected to have occurred In 
OS patients* Failure to obtain hybridisation from such a probe 

! j 

Indicates that the sequence of the probe differs in the patient's 
DNA. In particular It has been shown that Christmas disease can 
be detected or confirmed by such methodology. Useful probes can 

contain lntron and/or exon regions of the, genomic DNA or can 

1 i 

10 contain cDNA derived from the mRNA. 

The Invention Includes' particularly probe DNA, i.e. which is 
^labelled, and of a length suitable fox. the probing use envisaged. 
Dt can be single-stranded or double-stranded over at least the 
human factor IX DNA probing sequences thereof and such sequences 
15 will usually have a length of at least 15 nucleotides, preferably 
at least 19*30 nucleotides In order to have a reasonable probability 
of being unique They will not usually be larger than 5 kb and 
rarely longer than 10 kb. 

The Invention accordingly Includes a DNA molecule, comprising 
20 P*?t of the human factor IX DNA sequence, whether or not labelled, 
v whether lntron or exon or partly both. It also Includes human cDNA 
corresponding to part of all of human factor IX mRNA. It Includes 
particularly a solution of any DNA of the Invention, which la a 
form In which It Is conveniently obtainable by electroelutlon from 

The invention Includes, of course, a host transformed with 
any of the recombinant DNA of the invention. The host can be a 
bacterium, for example an appropriate strain of E. coll . chosen 
according to the nature of the cloning vehicle employed. Useful 

30 hosts may Include strains of Pseudomonaa . Bacillus subtllis and 
Bacillus stearothermophilus , other Bacilli , yeasts and other fungi 
and mammalian (including human) cells. 

One process practised in connection with this invention for 
preparing a host transformed with the recombinant DNA of the 

35 invention is based on the following steps:- 
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(1) syntheslslng an oligodeoxynucleotlde having a nucleotide 
sequence comprleing that occurring in bovine factor XX oeaaenger 
RNA coding for amino aclda 70-75 or 348*352 of bovine factor IX, 
and labelling the ollgodeoxynucleotlde to form a probe; 

(2) preparing complementary DNA to a mixture of bovine 
mRNAa; , 

(3) inserting the complementary DNA In a cloning vector to 
form a mixture of recombinant bovine cDNAe; 

(4) transforming a hoet with eald mixture of recombinant * 
bovine cDNAs to form a library of clonee and multiplying aald 
clones; 

(5) probing the clones with the synthetic ollgodeoxy- 
nucleotlde probe obtained in step 1 and isolating the resultant 
recombinant bovine factor IX cDNA-contalnlng clone; 

*5 (6) digesting the recombinant bovine factor IX cDNA from 

said clone with one or more enzymes to produce a bovine factor IX 
cDNA molecule comprising a shorter sequence of bovine factor IX 
DNA, but preferably at least 50 base-pairs long; and 

(7) probing a library of recombinant human genomic DNA in a 

20 transformed host with the shorter sequence bovine factor IX cDNA 
molecule, to hybridise the human genomic DNA to the said 
recombinant bovine factor IX DNA and isolating the resultant 
recombinant DNA-transformed host. 
Brief description of the drawings 

25 Figure 1 shows the structure of a published amlno-acid 

sequence of bovine factor IX polypeptide, the deduced sequence of 
the mRNA from which it would be translated and the structures of 
oligonucleotides (ollgo-Nl and N2) synthesised in the course of 
this invention; 

30 Figures 2 and 3 show the chemical formulae of "building 

blocks" used to syntheslse the oligonucleotides referred to in 
Figures 1 and 11; 

Figure 4 is an elevatlonal view-, partly sectioned, showing an 
. apparatus for syntheslsing oligonucleotides; 
33 Figure 5 shows the sequence of part of the bovine factor IX 

cDNA obtained in this invention; 
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Figure 6 is a map showing the organisation of an approxi- 
mately 27 kb length of human factor XX genomic DNA and is divided 
into five portions, shoving t- 

(a) the exon regions; 



from three clones in lambda phage vector. 

Figure 7 shows the sequence of the DNA of Figure 6(b) and in 
parts the encoded protein; 

Figure 8 shows a restriction enzyme chart of the sequence 
shown in Figure 7; 

Figure 9 shows part of the sequence of th'i human factor IX 
cDNA and its encoded protein; 

Figure 10 shows the structure of a pair of complementary 
oligonucleotides (ollgo N3 and N4) synthesised in the course of 
this invention; — 

Figure 11 shows part of the DNA sequence of the vector 
pAT153/PvuII/8 of this invention, in the region where It differs 
from pATISS; 

Figure 12 is a diagram of plasm id pHXX17 of the invention 
ehowlng the origin of the 1.4 kb fragment used for probing and 
initial sequencing; and 

Figure 13 shows the position of the major radioactive bands 
on probing a "Southern blot' 1 of normal human DNA, cut by the 
restriction enzymes EcoR I(E). Hindi 11(H), Bgll l(B) and Bell (Be), 
with a sub-clone of the recombinant human factor IX DNA of this 
invention. 

Description of preferred embodiments 
1. General description 

A recombinant DNA of the Invention can be extracted by means of 
probes from a library of cloned human genomic DNA. This is a 



(b) 
(c) 



the 11,873- nucleotide length eequenced; 

cDNA molecules obtained by restriction with various 

endonucleases, sub-cloned and subsequently used as 

probes; 

DNA molecules obtained by restriction with various 
endonucleases; and 

three regions of human factor IX genomic DNA derived 



(d) 



(e) 
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known recombinant library and tha invention dees not, of c outs a, 
axtand to human genomic factor IX DNA when preaent in auch a 
library. The probea used were of bovine factor IX cDNA (DNA 
complementary to bovine mRNA) , which were prepared by an elaborate 
process involving firstly the preparation of recombinant bovine . 
cDNA from a bovine mRNA atartlng material, secondly the chemical 
syntheses of oligonucleotides, thirdly their use to probe the 
recombinant bovine cDNA, in order to extract bovine factor IX cDNA 
and fourthly the preparation* of suitable probes of ehorter length 
from the recombinant bovine factor IX cDNA. The f irat probe tried 
appeared to contain an irrelevant sequence and the second probe 
tried, not containing it, proved successful in enabling a single 
clone of the human genomic factor IX DNA to be isolated. This 
clone is designated lambda HIX-1. The steps Involved are deacribed 
in more detail in the sub-section "Examples" appearing hereinafter, 
and the aecond probe comprises the 247 base-pair DNA sequence of 
bovine factor IX cDNA indicated in Figure 5 of the drawings. The 
invention therefore provides apecifically a recombinant DNA which 
comprises a cloning vehicle sequence and a DNA sequence foreign to 
the cloning vehicle, which recombinant DNA hybridises to a 247 
base-pair sequence of bovine factor IX cDNA indicated in Figure 5 
(by the arrows at each end thereof). 

* 

The cloning vehicle or vector employed in the invention can 
be any of those known in the genetic engineering art (but will be 
chosen to be compatible with the host) . They include E. coli . 
plasmlds, e.g. pBR322, pAT153 and modifications thereof, plasmlds 
with wider host ranges, e.g. RP4 plasmlds specific to other 
bacterial hosts, phages, especially lambda phage, and cosmlds. A 
cosmid cloning vehicle contains a fragment of phage DNA including 
its cos (cohesive-end site) inserted in a plasmld. The resultant 
recombinant DNA la circular and has the capacity to accommodate 
very large fragments of additional foreign DNA. 

Fragments of human factor IX genomic DNA can be prepared by 
digesting the cloned DNA with various restriction enzymes. If 
desired, the fragments can be rellgated to a cloning vehicle to 
prepare further recombinant DNA and thereby obtain "sub-clones". 
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In connection vlth this embodiment a new cloning vehicle has been 
prepared. This is a modified pAT133 plasmld prepared by ligating 
a BaaH I and Hindi II double digest of pAT153 to s pair of compli- 
mentary double sticky-ended oligonucleotides having a DNA sequence 
05 providing a BamH I restriction residue at one end, a Hindi II restric- 
tion residue at the other end and a PvuII restriction site In 
between. 

While the invention is described herein with reference to 
human genomic factor IX DNA in particular, the Invention Includes 

10 human factor IX cDNA (complementary to human factor IX mRNA) which 
contains substantially the same sequences. A library of human cDNA 
haa been prepared and probed with human factor IX genomic DNA to 
isolate human factor IX cDNA from the library. For thla purpose 
the probe DNA is conveniently of relatively short length and must 

15 Include at least one exon sequence. The invention therefore 

Includes a process of prepsring a host transformed with recombi- 
nant DNA, comprising cloning vector sequences and a sequence of 
nucleotides comprised in cDNA complementary to human factor IX 
mRNA, which process comprises probing a library of clones contain- 

20 ing recombinant DNA complementary to human mRNA with a probe 

comprising a labelled DNA comprising a sequence complementary to 
part or all of an exon region of the human factor IX genome. 
2. Examples ^ 

A. Bacteria used 

25 E. coll K-12 strain MC 1061 (Casadaban & Cohen, J.Mol. 

Biol. 138 , 179-207, 1980), E. coll K-12 strain HB 101 (Boyer & 
Roulland-Duasoix, J.Mol. Biol, 41, 459-472, 1969) and E. coll K-12 
strain K803 which is a known strain used by genetic engineers. 

B. Source and purification of bovine factor IX, anti-bovine 
30 factor IX antibody, and bovine mRNA 

Highly purified bovine factor IX and rabbit anti-bovine 

factor IX antiserum were gifts from Dr. M.P. Esnouf • Analysis- 

of the purified bovine factor IX on a denaturating polyacrylamide 
gel ahowed that it has a purity of greater than 99X. Specific 
35 anti-factor IX immunoglobulins used for immunopreclpltation 
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experiments were purified as described by Choo et al., Bio- 
chea.J. ±99, 527-535," 1981, by passage of the crude antiserum 
through a Sepharose-4B column onto vhlch pure bovine factor XX has 
been coupled* 

05 Bovine mRNA vas obtained from calf liver and laolated by the 

guanidine hydrochloride method (Chirgwin et al., Biochem. 18, 
5294-5299, 1979). The mRNA preparation was passaged through an 
oligo dT-cellulose column (Caton and Robertson, Nucl.Aclds 
Res. 2» 1445-1456, 1979) to isolate poly(A) ♦ mRNA. 

10 Poly (A) ♦ mRNA vas translated in a rabbit reticulocyte cell-free 
system in the presence of 35 S-cystelne as described by Pelham and 
Jackson (Eur. J. Biochem. 67, 247-256, 1976). At .the end of the 
translation reaction, factor IX polypeptide vas precipitated. by 
the addition of specific anti-factor IX immunoglobulins. The 

15 immunopreclpltatlon procedure vas as described* oy Choo et al . , 
Biochem. J. UU, 285-294, 1979. the immunoprecjLpitated material 
vas v&sh-d throughly and resolved on a tvo-dimensional SDS-poly- 
acrylaraide gel (Choo et al., Biochem. J. 181 , 285-294, 1979), by 
isoelectric focussing in one dlm^n^ipn and electrophoresis in 

20 another. Some polypeptides of known molecular veight vere 

subjected to this procedure, to serve as reference points. The 
immunoprecipitated material shoved 4 pronounced spots, all in 
the 50,000 molecular veight region and vlth separated isoelectric 
points. These predominant spots of molecular veight about 50,000 

25 represent a single polypeptide chain plus a possible propeptide 
signal sequence, a deduction compatible vith published data 
(Katayama e£ aI7,Proc. Natl. Acad. Scl.USA J±* 4990-4994, 1979). 

When the gel analysis vas repeated for the same material but 
immunoprecipitated in the presence of unlabelled pure bovine 

30 factor IX, the 4 spots appeared at reduced intensity, indicating 
that the translation product is specifically competed for by pure 
factor IX. Thirdly, Immunopreclpltatlon vas performed using a 
control rabbit antiserum, i.e. from a rabbit vhlch had not been 
immunised vlth factor IX. None of the 4 spots appeared. These 

35 results therefore indicate that the translation product vas a 
factor IX polypeptide. 



i 



:^ 0107278;. 

.. - '12 - ' 

The specific iramunological/cell-f ree translation assay 
established above was used to monitor the enrichment of factor IX 
mRNA on sucrose gradient cent rifugat ions. Total poly(A)+mRNA was 
resolved by two euccessive separations by sucrose gradient centrl- 

03 fugations. When individual fractions from the gradient were 
assayed by the above method, a fraction of site 20-22 Svedberg 
units (approx* 2,5 kllobases of RNA) region was. found to be 
enriched (approx. ten-fold) for the bovine factor IX mRNA. This 
enriched fraction was used In the subsequent cloning experiments. 

10 C. Synthesis of specific bovine factor IX deoxyollgonucleotide 
mixtures 

Starting from a knowledge of the amino acid sequence of 
bovine factor IX (Katayama ej^al., Proc.Nfetl.Acad.Sci. 
USA2i» 4990-4994, 1979), the synthesis of two mixtures of 

15 oligonucleotide probes was designed. These probes consisted of 
DNA sequences coding for two different regions of the protein. 
The regions selected were those known to differ in sequence In the 
analogous serine proteases, prothrombin, Factor C and Factors VII 
and X and were those corresponding to amino acids 70-75 and 348-352 

20 respectively. The 70-75 region was particularly favourable In 
that the mixture of oligonucleotides syntheslsed, I.e. ollgo N2A 
and ollgo N2B, contained all 16 possible sequences that might 
occur in a 17 nucleotide long region of the mRNA corresponding to 
amino acids 70-75. The ollgo N2A-N2B mixture le hereinafter 

25 called M ollgo N2" for brevity. 

Figure 1 of the drawings shows the two selected regions of 
the known amino acid sequence of bovine factor IX, the corres- 
ponding mRNA and the oligonucleotides syntheslsed. Since some of 
the amino acids are coded for by more than one nucleotide triplet, 

30 there are 4 ambiguities in the mRNA sequence shown for amino 
acids 70-75 and therefore 16 possible individual sequences. 

The nucleotide mixtures ollgo Nl and ollgo N2 were synthe- 
slsed using the solid phase phosphotrlester method of Duckworth 
*t al., Nucl.Aclds Res. £, 1691-1706, 1961, modified in two ways. 

35 Firstly, £-chlorophenyl rather than £-chlorophenyl blocking groups 



-0107278 

- 13 - 

vert used for the phosphotrlester grouping, and were Incorporated 
in the mononucleotide and dinucleotide "building blocks 19 . Figures 2 
and 3 of the drawings show (a) dinucleotide and (b) mononucleotide 
"building blocks". DMT - 4,4 1 - dlmethoxytrityl and B • 6-N-benzoyl- 
adenin-9-yl, 4-N-benzoylcytosin-l-yl, 2-N-lsobutyrylguanin-9-yl or 
thymln-l-yl, depending on the nucleotide selected. Secondly, the 
"reaction cell" used for the successive sdditlon of mono* or 
dinucleotide "building blocks" was miniaturised eo that the coup- 
ling etep with the condensing agent i-(mesii/lene-2-sulphonyl)-3- 
nltro-l,2,4-triasole (MSNT) was carried out in a volume of 0,5ml 
pyridine containing 3.5 mlcromoles of polydlmethylacrylamlde 
resin, 17.5 mlcromoles of incoming dinucleotide (or 35 mlcromoles 
of mononucleotide) and 210 mlcromoles of MSNT. 

Figure 4 of the drawings is an elevatlonal view of the micro- 
reaction cell 1 and stopper 2 used for oligonucleotide synthesis, 
drawn 70% of actual size. The device comprises a glass-to-PTFE 
tubing joint 3 at the inlet end of the stopper 2. The stopper has 
an Internal conduit 4 which at its lower end passes into a hollow 
tapered ground glass male member 5 and thence into a sintered 
glass outlet 6 to the stopper. The cell 1 has a ground glass 
female member 7 complementary to the member 5 of the stopper, 
leading to. reaction chamber 8, the lower end of which terminates 
in a sintered glass outlet 9. This communicates with glass tubing 10, 
and a 1.2mm. "Interflow" tap 11. Further glass tubing 10, beyond 
the tap 11, leads to the outlet glass-to-PTFE tubing Joint 12. 
Pairs of ears 13 on the stopper and cell enable them to be joined 
together by springs (not shown) in a liquid-tight manner. 

After completion of the synthesis and deprotectlon, fractiona- 
tion was carried out by High pressure liquid chromatography (Duckworth 
et al. ,.see above) and the peak tubes corresponding to the product 
of correct chain length were located by labelling of fractions at 
their 5*-hydroxyl ends using [gamma- P]-ATP and T4 polynucleotide 
kinase, followed by 20% 7M urea polyacrylamlde gel electrophoresis. 
The position on the gel of the 17- and 14- oligonucleotides was 
determined by separately labelling, by the method described 
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above, 17- and 14- nucleotide long "marker" oligonucleotides and 

•objecting that* to the aame gal alectrophoresla. 

D. Preparation of libraries of cDNA sequences for bovlna mRNA 

Two different approaches were used for the generation of 
cloned cDNA library :- 

(1) Mbol library Flrat atrand cDNA vaa aynthealaed ualng the 
sucrose gradient-enriched poly(A) ♦ bovine mRNA as template. The 
conditions used vere as described by Huddleston & Brovnlee, Nucl. 
Acids Ree^O, 1029-1030, 1981, except that 2 micrograms of oligo 
N-l, 20-30 micrograms of the mRNA, 10 microcurles talpha- 32 P] 
-dATP (Amersham, 3000 Ci/mmole), and 50 U of reverse transcriptase 
vere used in a 50 rnicrolitre reaction. "dNTP" in Figure 1 denotes 
the mixture of 4 deoxynucleoalde triphosphatea required for 
synthesis. Oligo N-l hybridises to the corresponding region on 
the mRNA (refer to Figure 1) and thereby acta as a primer for the 
initiation of transcription. It was used in order to achieve a 
further enrichment for factor IX mRNA. At the end of the cDNA 
synthesis reaction, the cDNA was extracted vlth phenol and 
desalted on a Sephadex-GlOO column, before it was treated with 
alkali (0.1M NaOH, ImM EDTA) for 30 min. at 60°C to remove the 
mRNA strand. Second strand DNA synthesis was then carried out 
exactly as published (Huddleston & Br own lee, Nucl.Aclda 
Res. 10, 1029-1038, 1981). 

The double-stranded DNA was next cleaved with the restriction 
enzyme Mbol and H gated to the plasmld vector pBR322 which had 
been cut with BamHI and treated with calf intestinal alkaline 
phosphatase to minimise vector aelf-religatlon. Phosphatase 
treatment was carried out by incubating 5 micrograms of BamH I-cut 
pBR 322 plasmid with 0.5 microgram calf intestinal phosphatase 
(Boehrlnger; in lOmM Tris - HC1 buffer, pH 8.0) in a volume of 50 
microlitrea at 37°C for 10 minutes, aee Huddleston & Brownlee 
supra . 

The llgated DNA vaa uaed to transform E. coll strain MC 1061. 
For transformation E. coll MC 1061 vaa grown to early exponential 
phase as indicated ~by an absorbancy of 0.2 at 600 nm and made 
"competent" by treating the pelleted bacterial cells first vlth 
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one half volume, followed by repelletlng, and then vlth 1/50 volume 
of the original growth medium of lOOmM CaClj 15X v/v glycerol and 
lOmM PIPES-NaOH, pH 6.6 at 0°C. Celle were immediately froien in 
a dry lce/ethanol bath to -70°C. For transformation, 200 microlitre 

05 allquots were mixed with 10 microlitre a of the recombinant DNA and 
Incubated at 0°C for 10 minutes followed by 37°C for 5 minutes. 
200 microlitree of L-broth (bactotryptone 10g., yeast extract 5g«, 
sodium chloride 10g., made up to 1 litre with deloniaed water) 
were then added and incubation continued for a further 30 minutee 

10 at 37°C. The solution was then plated on the appropriate antibiotic 
agar (see below). A library of about 7,000 ampicillin-reslstant 
colonies was thus obtained. They were ampicillin-reslstant because 
they contained the beta-lactamase gene of pBR 322. Of these, 
approx. 85% were found to be tetracycllne-sensltlve. 

15 (ii) dC/dC tailed library In the preparation of thie library, 
first s'.rand cDNA was synthesised as described for the above 

library except that oligo dT ( i 2-18) VftS u9ed * s a P rimcr t0 
initiate cDNA synthesis. Following this, the cDNA wae tailed with 
dCTP using terminal transferase and back-copied with the aid of 

20 oligo 4C( 12-I8) P rimer and averse transcriptase to give double 
stranded DMA, exactly according to the method of Land et al. , 
Nucl. Acids Res. 9., 2251-2266, 1981. After a further tailing with 
dCTP, this material was annealed by hybridisation to a dGTP-tailed 
pBR322 plasmld at the PstI site. The hybrid DNA was used to 

25 transform JS.cpli strain MC 1061. A library of approximately 10,000 
tetracycline-reslstant colonies was obtained. Of these, approxi- 
mately 80Z were found to be sensitive to ample ill in, due to 
insertion of DNA into the ampicillin-reslstant gene at the PstI 
site. 

30 E. Isolation of specific bovine factor IX clones 

(i) From Mbol library 

The library of colonies, in an unordered fashion, was 

tranaferred onto 13 Whatman 541 filter papers and amplified with 

chloramphenicol, to Increase the number of copies of the plasmld 
35 in the colonies, as described by Gergen e£ al^. , Nucl. Acids 

Res., I* 2115-2136 (1979). The filters were pre-hybrldlsed 
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at 65°C for 4h in 6 x NET (1 x NET - 0,15m NaCl, ImM EDTA, 15mM 

Trie-HCl, pH 7.5), 5 x Denhardt'e, 0.5X NP40 non-ionic surfactant, 

and 1 microgram/ml. yeast RNA aa described by Wallaca at al., 

Nucl. Acids Res. 9, 879-894 (1981). Hybridisation vas carried out 

at 47°C for 20h in the same solution containing 3 x lo'cptn (0.7 

nanogram/ml) of labelled ollgo N-2 probe. Labelling vas done by 

phosphorylation of the oligonucleotides at the 5* hydroxyl and 
32 

using [gamma- P) -ATP and T4 phosphokinasa (Huddlaston & Brovnlee 
Nucl. Acids Res. 10 t 1029-1036, 1981). At the and of the hybridi- 
sation, filtera were washed successively at 0-4°C (2h), 25°C 
(10 min), 37°C (10 min) and 47°C (10 rain). After radioautography 
of the filters from this screening, one colony shoved a positive 
signal above background. This colony vas designated BIX-1 clone, 
(ii) From dC/dC-tailed library 

Screening of this library, in an ordered array fashion, using 
ollgo N-2 probe at, described above has resulted in the identifica- 
tion of a positive clone. This vas designated BIX- 2 clone. 
F. Sequence characterisation of bovine factor IX cDNA clones 

Characterisation of BIX-1 clone by restriction endonuclease 
cleavage indicated that it contained a DNA lnaert of about 430 
baaa-palrs (data omitted, for brevity). Figure 3 shows part of 
the nucleotide sequence of the coding strand, determined by the 
Maxam-Gllbert method, extending over 304 nucleotides and provides 
direct evidence that it has the identity of a bovine factor IX 
sequence. Thus,' nearly all of thla 304 nucleotide sequence 

(corresponding to amino acid residues 52-139) agrees with the 

.i 

nucleotide sequence predicted from the known bovine factor IX 
amino acid sequence data (Katayama et al., Proc.Natl.Acad.Scl. 76 , 
4990-4994, 1979). Over thla region, there are no discrepancies 
between BIX-1 and these published data for factor IX, except at 
nucleotides 38-40 where the amino acid coded for la Asp Instead of 
Thr. This amino acid change vas similarly observed In a second, 
independent cDNA clone (BIX-2; see below). The remainder of 
the 304-nucleotlde sequence, i.e. that shown In brackets in 
Figure 5, does not sgree with the published bovine factor IX 
amino acid data of Katayama. 
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In Figure 5, the underlined portion denotes the sequence 
corresponding to the oligo N-2 probe sequence, the eeteriek 
denotes e nonsense cod on, the brackets enclose a eequence Which 
doee not correspond to Katayama'a amino acid data and the arrows 
indicate Hinf l restriction sites. The Katayama numbering system 
for amino acids is shown and this sequence is in the opposite 
orientetlon to the direction of transcription of the tetra- 
cycline-resiatant gene of the plasmld. 

By similar methods* BIX-2 clone was found to have a DNA 
ineert of 102 nucleotides end thle spans the nucleotide posi- 
tions 7-108 as shown in Figure 5. The nucleotide sequences for 
BIX-1 and BIX-2 clonee over this region (nucleotide 7-108) were * 
identical. 

C Isolation of human factor IX gone 
U) Initial clone - lambda HIX-1 

A library of cloned human genomic DNA, namely a Haelll/Alul 
lambda phage Charon 4A library prepared by Lam et 
Cell, ^15, 1157-1174, 1978, was used. 10 6 phage recombinants from 
this library were ecreened using the in eitu pleque hybridieation 
procedure as described by T. Maniatis et al., Cell, ^5, 687, 1978. 
Pre-hybridisation and hybridisation were carried out at 42°C 
in 50% formamlde. After hybridisation, filters were washed at 
room temperature with 2 x SSC (1 x SSC - 0.15mM NaCl, 15mM eodlum 
citrate, at pH 7.2) and 0.1% SOS, then at 65°C with 1 x SSC 
and 0.1% SDS. 

Two DNA molecules, being restriction fragments from the 
factor IX cDNA cloned in BIX-1, were radiolabeled and used as 
probes in the hybridisation. The first fragment corresponds to 
nucleotide numbers -8 to 317 on the numbering system of Figure 5, 
and wae Isolated by Sau3AI digestion of BIX-1 plasmld DNA. The 
isolated DNA was labelled to high specific activity by incorpora- 
tion of [alpha - 32 P] -dATP using a nick translation (Rigby at al., 
J. Mol.Biol. H3. 237-251, 1977, modified, vide infra) . Using 
this probe, 10 clonee were isolated. These were plaque-purified * 
and re-hybrldlaed with a 247-nucleotide fragment from BIX-1 clone. 
This fragment, derived from nucleotides 3-249 can be seen from 
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Figure 5. It contains only sequences In agreement with the 
Katayama bovine factor IX amino acid sequence and was isolated by 
Hinfl digestion of BIX-l plasmld DNA. Only a single clone gave a 

positive hybridisation signal vith this 247-nucleotlde probe. 

v 

This clone was further plaque-purified and the resulting clone was 

designated "lambda HIX-1". 

*(ii) Subsequent genomic clones 

A sub-clone, pATIXcVII, of recombinant human factor IX cDNA 

from human liver mRNA, and prepared as described in Section L 

below, was linearised ly digestion with Hind II I and BamHI. The 

resulting 2 kb cDNA molecule was purified by IX agarose gel electro 

phoresis. After electroelutlon, about 100 ng of this cDNA was 

32 

nick-translated with [alpha p] dATP (see .above) and used as a 
hybridisation probe to screen the Hael ll /Alu l lambda phage 
Charon 4A human genomic DNA library for further genomic clones, 
using standard stringent hybridisation conditions. Two further 
human factor IX genomic clones, designated lambda HIX-2 and lambda 
HIX-3,~were thus obtained. 

H. Characterisation of human factor IX genomic clones 
(i) Restriction map 

The Initial lambda HIX-1 clone was characterised by cleavage 
with various single and double digests with different restriction 
endonucleases and Southern blotting of fragments using the bovine 
factor IX cDNA probe (results omitted for brevity). The subse- 
quently Isolated lambda HIX-2 and 3 clones were characterised in 
the same way except that the human cDNA probe, pATIXcVII (aee 
Section L below) was used for the Southern blots. Prom these 
results it emerged that the sequences in the factor IX genome 
corresponding to lambda HIX-2 and 3 overlapped with lambda HIX-1 
as shown in Figure 6(e). In Section (d) of this Figure 6 are 
summarised the results of the analysis using the restriction 
ansymes EcoRI (E), Hlndlll (H), Bglll (B), BamH I (Ba) and 
PvuII (P), and this serves as a restriction encyme map. 
.(ii) Sequencing * 

Numerous sub-clones were Isolated from a knowledge of the 
rectrlction enzyme map as described in Section J(il) below, the 
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majority in a vector pAT133/PvuII/8. Examples of those sub-clones 
are shown Id Plgure 6(c) and a number were used and were of a 
convenient length for sequence analysis by the Maxam-Gllbert 
method (Maxam 6 Gilbert, Proc. Natl, Acad .Sci. USA 74 t 36-364, 1980). 

Initially stquenclng waa done on part of a 1.4 kb EcoRI 
restriction fragment from the sub-clone pHIX-17, see below 
and J(l). A 403-nucleotide (base-pair) length was sequenced, of 
which a 129-nucleotlde length was Identified as lying within an 
axon region. This Is the 129-nucleotide sequence used above to 
define the factor IX DNA. 

Subsequently, a region of 11873 bases was sequenced In the 
central portion of the gene [see Figure 6(b)]. Plgure 7 shows 
the sequence of one strand of the DNA. The nucleotides are 
arbitrarily numbered from 1 to 11873 in the 5 f to 3' direction. 
The original 403-nucleotlde sequence runs from Plgure 7 nucleo- 
tides Nob. 4372 to 4774 and is indicated by 0-0'. The 129-nucleo- 
tide sequence lying within the 403 one, runs from Figure 7 nucleo- 
tides Nos. 4442 to 4570 and is indicated by J-J\ This corresponds 
exactly to the "wV exon. 

In detail, the sequence of nucleotides Nos. 1-7830 contains 
two short exons (nucleotides 4442-4570 and 7140-7342 respectively) 
marked w and x in Pigure 6(a), J-J« and J f -J" in Figures 7 and 9. 
These code for amino acids 85-127, and 128-195 respectively of the 
amino acid sequence predicted from the human factor IX cDNA clone 
(Figure 9). There are no differences in amino acid sequences 
predicted from the genomic and cDNA clones of the invention in 
these two exon regions. The sequence of the gene between 
residues 7831-11873 is less complete, containing several gaps, but 
is still a useful characterisation of the gene as it contains two 
" A 1"I repeat" sequences, nucleotides. 7960-8155 and 9671-9938. 
Alul sequences are found in many genes. The repetition is not 
exact but there is a typical degree of homology between them. 
This further characterisation provides a useful cross-check on the 
accuracy of the restriction enzyme map. This emerges more clearly 
from the restriction enzyme chatt of Figure 8. 
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Figure 8 la a chart produced by a computer analyala of the 
sequence data of the 11873 nucleotide long aequence of Figure 7. 
Column 1 of Plgure 8 glvea the arbitrary nucleotide number allotted 
to the nucleotide of Figure 7. Column 2 apportions the nucleotide 
number ae a fraction of the whole sequence. Column 3 above the 
restriction ensymee which will cut the DNA within varloua ehort 
sequences of nucleotides shown In Column 4. The ehort sequences 
of Column 4 begin with the nucleotide numbered In Column 1. With 
the aid of this chart the positions of the restriction sites shown 
In Figure 6(d) and some of the sequences shown in Figure 6(c) can 
be determined very accurately. For example sequences 11- IV are 
produced by restriction at the following sites (denoted by the 
first nucleotide number at the 5 9 end of each alts). 

II 
III 
IV 

Particularly Important eltes are srrowed In Figure 8. Some of the 
relevant nucleotide numbers are shown In Figure 6(c) » the number 
given being that of the nucleotide at the 5' end of each site. 

Further sequence analysis of the sub-clones V, VI, VII 
and VIII shown In Figure 6(c) indicates that the factor IX gene is 
divided Into at least 7 axon regions separated by at leaat 6 
in t rone. The poeitlons of the exons are ehown in Figure 6(a) by 
the solid blocks labelled t, u, v 9 w, x t y and i. The "s" axon Is 
much the longest and its 3 '-end coincides with the 3' -end of the 
MRNA. The location of these exons relative to the cDNA eequence 
Is discussed below (section L) and it is clear that the "t" axon 
ehown in Figure 6(a) is not a marker for the 5'-end of the gene, 
as its sequence fails to match that of the extreme 5 '-end of the 
cDNA clone (see below). This suggests that the factor IX gene 
will be longer at its 5 '-end than the 27 kb region shown In 
Figure 6, and will contain at least one further exon. 

Additionally, pHIX-17 DNA was digested with EcoRI. The 
digested meter lal was reeolved on 0.8X agarose gel and a 1.4 kb 



3624 - 4769 
6380 - 7378 
10389 - 11868 
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fragaant wae iaolated In solution by electroelutlon. It can bo 
etored in the usual manner. This 1.4 kb long molecule was uaad 
for ths Initial saquanclng. Only about L.O kb ia Insartad DNA, 
tha remaining 0.4 kb balng of pBR322. A 403 nuelaotlda langth of 
05 tha insartad DNA was sequenced and is identified aa 0~0' in 

Figure 7. The same 1.4 kb fragment was also labelled and used as 
a probe in Section M. 

I* Construction of a vector pAT133/PvuII/8 

A derivative of the plasmld pAT153 (Twig & Sherratt, 

10 Nature 283, 216-218, 1980) vaa prepared for aubclonlng of PvuII 
fragments of factor IX genomic clones, and for ease of charac- 
terisation of the reaultant subclone a. Two partially complemen- 
tary aynthatlc daoxyollgonucleotldea, oligo N3, and, ollgo N4, 
were synthcaieod by the solid phase phosphotriester method 

15 described in Section C above. Each has "overhanging" BamHI and 
Hindi II recognition sequences and an Internal PvuII recognition 
sequence. Figure 10 shows the structures of oligo N3 and ollgo N4. 
BamHI and Hindi I I cleave ds DNA to leave sticky or "overhanging 99 
ends. For example Hind HI cleaves 

20 - AAGCTT 
- TTCGAA 

between the adenlne-carrylng nucleotides of each atrand leaving 
the aticky-ended complementary strands i- 

- A 

25 - TTCCA 

which are present in the oligo N3/N4 combination. 

pAT153 was digested with Hindi I I and BamHI and the 3393 
nucleotide long linear fragment was aeparated from the 346 nucleo- 
tide ahorter fragment by 0.7% agaroaa gel electrophoresis, followed 

30 by electroelution of the appropriate bands visualised by ethldlum 
bromide fluorescence under UV light. After treatment with calf 
inteatinal phosphatase, as described in Section D(i), the BamHI- 
Hindlll 3393-long fragment was llgated to an equlmolar mixture of 
ollgo N3 and ollgo N4 which themselves had been pre treated, aa a 

33 mixture, with T4 polynucleotide kinase and ATP, to phosphorylate 
their respective 5 9 -terminal OH groups. After transforming 
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competent NC 1061 cells (see above) and plating on L-broth plates 
containing 20 mlcrograms/ml final concentration of amplclllln, 11 
colonies v»re selected for further analysis* 1 ml plasmld prepara- 
tion, see Holmes and Qulgley, Analytical Biochex. 114 , 193-197 (1981), 

03 vaa isolated from the 11 colonies . Ths plasmld DNA was then 
analysed for its ability to be linearised by the restriction 
enzymes BamHI, Hindi II and PvuII. Four clones were positive in 
this assay and one, labelled pAT153/PvuII/8, was selected for 
sequence analysis by the Kaxam-Gllbert method across the newly 

10 constructed section of the plasmld. This part of the sequence Is 
shown in Figure 11 along the unique restriction sites. The novel 
part of the plasmld sequence is underlined i the remainder is 
present in the parent plasmld pAT153. The vector allows blunt-end 
cloning (aftar treatment- with phosphataat)-<into-the Inserted PvuI I 

15 site. The cloned DNA can be excised, assuming that it lacks 
appropriate internal restriction sites, with BamHI /Hind III, 

BamH I /Clal or BamH I /EcoR I double digests. The sites adjacent to 

— — — . 32 

the Pvu II site are also convenient for end labelling with P for 
characterization of the ends of cloned DNA by the Maxam-Gllbert 

20 ••l»* nf! * n 8 method. 

J. Sub-cloning of human factor IX gene 

The following subclonlng experiments wsre carried out as a 
first step towards sequencing of the factor IX gene, and to 
facilitate the isolation of a small DNA fragment to be used as a 

?* probe for the analysis of genomic DNA from haemophilia B patients 
(see eectlons M). 

(i) Sub-cloning into pBR322 plasmld 

An approximately 11 kilobase Ball I fragment (see Figure 6) 
within the factor IX DNA inaert in lambda HIX-1 clone was inserted 
30 into the BamHI aite of pBR322. Transformation was carried out in 
the £• coll strain, HB 101. The resulting "sub-done" was desig- 
nated pHIX-17 (Figure 12). 

(ii) Sub-cloning into pAT153/PvuII/8 

(a) Plasmld DNA from pHIX-17 was prepared and cleaved with PvuI I. 
35 Five discrete fragments, all derived from the DNA insert of 
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pHIX-17, were Isolated. The sites of these fragments were approxi- 
mately 2.3, 1.3, 1.2, 1.1 and 1.0 kllobases. These fragments were 
blunt-end ligated Into the Pvull site of the pAT153/PvuII/8 vector 
•nd transformed into E. coll HB 101. Five clones of recombinant 
DNA which carried the 2.3, 1.3, 1.2, 1.1 and 1.0 kb fragments wore 
obtained and these were designated pATIXPvu-l, 2, 3, 4 and 3 
respectively. Factor IX DNA from pATIXPvu-2 U abbreviated as IV 
and pATIXFvu-5 as III In Figure 6(c). 

(b) Phage DNA from the lambda HIX-1 genomic clone was digested with 
EcoRI. Three different fragments (approximately 5, 2.3, 0.96, kb; 
see Figure 6), all derived from the insert into the phage, were 
isolated and Inserted in pAT153/PvuII/8 vector at the BcoRI sits 
and cloned in E.coll HB 101 to form sub-clones. The three' resulting 
clones for each of these fragments were designated pATIXEco-i, 2 
•nd 4 respectively which are shown in the restriction map of 
Figure 6(d). pATIXEco-1 was further digested with both EcoRI and 
Ball I, and the "overhanging ends" of the restriction sites filled 
In with deoxynucleoslde triphosphates using the Klenow. fragment of 
DNA polymerase I. After isolation of the resulting l.l kb fragment 
by agarose gel electrophoresis and alectroelutlon, it was blunt-end 
ligated using T4 DNA ligase into" the Pvull site of pAT153/PvuII 
and allowed to transform E.coll MC 1061. The resultant sub-clone 
was designated pATIXBE and the factor IX DNA sequence thereof is 
abbreviated as II in Figure 6(c). 
25 (c) Phage DNA from lambda HIX-2 was digested wit* HlndH I and 
- EcoRI giving a 1.8 kb and a 2.6 kb fragment amongst others. These 
fragments were eluted separately, filled in as described in (b) 
•bove, cloned as above into the PyuII site of pAT153/PvuII/8 and 
allowed to transform E.coll MC 1061. The resultant clones were 
JO designated pATIXHE-1, and the factor IX DNA sequence thereof Is 
abbreviated as V in Figure 6(c), and pATIXEco-6 and the factor IX 
DNA sequence thereof is abbreviated as VI in Figure 6(c). 
(d) Phage DNA from lambda HIX-3 was digested with EcoRI and 
Hindll I and the fragments of 2.3 kb and 2.7 kb were sub-cloned 
13 exactly as described in (c) above. The resultant clones were 
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designated pATIXBH-1, abbreviation VII in Figure 6(c), and 
pATIXHE-2, abbreviation VIII In Figure 6(c). 

K. Preparation of e library of cDNA donee from human liver mRNA 

Messenger RNA was extracted from a human liver and a 20-22 
Svedberg unit enriched fraction of mRNA prepared exactly as described 
for bovine mRNA in Section B above, except that a 'translation 
assay • was not used. The first steps in the construction of the 
double-stranded DNA were carried out using the 'Stanford protocol 9 
kindly supplied from Professor P Berg's department at Stanford 
University, USA. This itself is a modification of Wickens, Buell & 
Schlmke (J.Blol.Chem. 253 , 2483-2495, 1978) and some further 
modifications, incorporated in the description given below were 
made in the preaent work. 

For the first strand cDNA synthesis 6 micrograms of poly (A)* 
20-22S human mRNA was incubated with. 5 mlcrolitres of lOx buffer 
(0.5 M Trls-chloride, pH 8.5 at room temperature, 0.4 H KC1, 0.08M 
MgClg and 4 mM dlthlothreitol) , 20 mlcrolitres of a 2.5 mM mixture 
of each of the four deoxynucleoside triphosphates, 0.5 mlcrolitres 
of oligo dT ^2-18)* 1 nicrolitre (containing 0.5 mlcrocurle) of 
[alpha- 32 P] -dATP, 2 mlcrolitres of reverse transcriptase (14 
units per mlcrolitre) and the volume made up to 50 mlcrolitres 
with de ionized water. After incubation for 1 hour at 42°C, the 
solution was boiled for lh minutes and then rapidly cooled on Ice. 
The second strand synthesis was carried out by adding directly to 
the above solution 20 mlcrolitres of 5x second strand buffer (250 mM 
Hepes/KOH pH 6.9, 250 mM KC1, 50 mM MgCl 2 ), 4 mlcrolitres of 
a 2.5 mM mixture of each of the four deoxynucleoside triphos- 
phates, 10 mlcrolitres of E. coll DNA polymerase I (6 units per 
mlcrolitre) and making the volume of the solution up to 100 micro- 
litres with deionlsed water. After incubation for 5 hours at 15°C, 
nuclease digestion was carried out by the addition of 400 micro- 
litres of S^ nuclease buffer (0.03 M sodium acetate pH 4.4, 0.25 M 
NaCl, 1 mM ZnSO^) and 1 mlcrolitre of nuclease (at 500 units 
per mlcrolitre). After incubating for 30 minutes at 37°C, 10 
mlcrolitres of 0.5 M EDTA (pH 8.0) waa added. Double stranded DNA 
was deprotelnlsed by shaking with an equal volume of a phenol: 
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chloroform (1:1) mixture, followed by ether extraction of the 
aqueous phase end precipitation of da DNA by addition of 2 volumes 
of ethanol. Aftor 16 houra at -20°C, da DMA wea recovered by 
centrlfugation. DNA polymerase 1 "fill in" of S l ende vaa carried 
out by a further Incubation of the sample dissolved In 25 micro- 
litres of 50 mM tris-chloride, pH 7.3, 10 mM MgCl 2 , 5 mM dithio- 
threltol and containing 0.02 mM dNTP and 6 unite of DNA polymerase I, 
After Incubating for 10 minutes at room temperature, 5 mlcrolltrea 
of EDTA (0.1 M at pH 7.4) and 3 mlcrolltres of 5% sodium dodecyl 
sulphate (SDS) were added. 

The following part of the protocol dlffera from the Stanford 
protocol*. The sample vaa fractionated on a "mini"-Sephacryi 
S400 column run In a disposable 1 ml pipette In 0.2 M NaCl, 10 mM 
Trls-chlorlde, pH 7.5 and 1 mM EDTA. The first 70% of the "break- 
through" peak of radioactivity vaa pooled (0.4 ml) and deprotelnlaed 
by sheklng vlth an equal volume of n-butanol: chloroform (1:4). To 
the aqueous phase was added 1 microgram of yeast RNA (BDH) as 
carrier followed by 2 volumes of ethanol. After 16 houre at -20°C 
double etranded DNA vaa recovered by centrlfugation for blunt-end 
ligation Into calf intestinal phoaphatase-treated, PvuII-cut 
pAT153/PvuII/8, using T4 DNA llgase (see I and J(ll) above). 
After performing a trial experiment, It was found that when the 
bulk of the sample was Incubated with 200 nanograms of vector DNA 
In a eultable buffer (1 mM ATP, 50 mM Trla-chlorlde, pH 7.4, 10 mM 
MgCl^ and 12 mM dlthlothreltol) and ualng 10 mlcrolltres of T4 DNA 
llgase In a total volume of 0.2 ml, then on subsequent transforma- 
tion of competent E. coli MC 1061 cells a total of 58,000 ampicillln- 
resistant colonlea were obtained. Up to 20% of these were estimated 
to derive from "background" non-recombinanta derived by rellgatlon 
of the vector itself. This 20-22S cDNA library was amplified by 
growing the E. coli for a further 6 houra at 37°C. 1 ml allquota 
of this amplified library were etored at -20°C in L broth contain- 
ing 15% glycerol, before ecreenlng for factor IX cDNA clones. 
**• Iaolatlon and sequence analysis of human factor IX cDNA clonee 

6000 colonlea of the amplified 20-22S human cDNA library were 
plated on each of ten 15 cm agar plates and after growing overnight 
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were blotted Into Whatman 541 filter paper. After preparing 

filtera for hybridisation as described In aaction E(i) above , 

the immobilised colonies were probed with a 1.1 kb molecule of 
32 

[alpha- P] -nick translated human factor IX genomic DNA isolated 
from the pATIXBE aubclone (Section J t above). This linear 1.1 kb 
section of factor IX genomic cDNA vas Isolated from pATIXBE by 
cleavage with the restriction enzymes BamHI and Hlndl ll. followed 
by aaparatlon of the 1.1 kb section from the vector by 1.3X agarose 
gel electrophoresis. After electroelution, nick-translation was 
carried out aa before and the material used in a hybridisation 
reaction for 16 hours at 65°C in 3x SSC, lOx Denhardta solu- 
tion, 0. IX SDS and 30 mlcrograms/ml sonicated denatured E. coll DNA 
and 100 mlcrograms/ml of sonicated denatured herring sperm DNA. ) 
After hydridlsatlon filters were washed at 65°C successively 
in 3x SSC, 0.1X SDS (2 changes, half an hour each) and 2x SSC, 0.1X 
SDS (2 changes, half an hour each). After radioautography , 7 clones 
were selected as positive, but on dilution followed by ra-acreenlng 
by hydridlsatlon aa above, only 5 proved to be positive, Plaamld 
DNA was Isolated from each of these 5 clonea and one, designated 
pATIXcVII, waa selected for sequence analysis aa it appeared to be 
the longest of the 5 clones aa judged by ita electrophoratlc 
mobility on IX agaroae gel electrophoresis. A second shorter 
clone, designated pATIXcVII was also selected for partial sequence 
analysis. 

Sequencing was carried out by the Maxam-Gilbert method and 
a 2778 nucleotide long section of sequence is shown in Figure 9. 
Nucleotides 115-2002 were derived by sequencing clone pATIXcVII. 
(The actual extent of this clone Is greater as it extends in a 5* 
direction to nucleotide 17. The sequence between 17 and 111 la 
Inverted with respect to the remainder of the aequence presumably 
due to a cloning artefact.) Nucleotides 1-130 were derived from 
clone pATIXcVI which extends from nucleotides 1-1348 of Figure 9. 
The sequence from Nos. 2002-2778 was derived by isolating 4 
additional clonea dealgnated pATIX108.1, pATIX108.2, pATIX108.3 
and pATIXDB. The first 3 were derived from a mini-library 
(dealgnated GGB108) of cDNA clones constructed exactly as 
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described In aectlon K above except that aucroae denelty gradient 

centrlfugatlon waa ueed Instead of chromatography on "Sephacryl" 

S-400 to fractlonata tha doubla-atrandad DNA according to alia. A 

fraction of m.w. from 1 kb-5 kb vaa aalactad and an amplified 

library of 10,000 independent clones containing approximately 20X 

background non- recombinant clones vaa obtained. Clone pATIXDB 

derived from another cDNA library (dealgnatad DB1) conatructad as 

dsscribed in aection K except that total poly A+ human liver mRNA 

vaa uaed aa the starting j^terlal and sucrose density gradient 

centrlfugatlon waa used to fractionate the DNA according to alee 

aa in the conatructlon €f the mini-library GGB108. The complexity 

of this, library vaa 93,000 vith an estimated background of non- 

recomlbinants of 50*. Clonea pATIX108.1 and pATIX108.2 vare 

aelected from a qro up^of 30 hybridiaation-poaitive clonea laolated 

by Grunsteln-Kogness screening of the mini library CCB108 ualng 
32 

a P-n*ck translated probe derived from a Sau3AI restriction 
enzyme fragment, Itself derived from nucleotides 1796-2002 of 
clone pATlXcVU. From p AT IX 108.1 the sequence of nucleo- 
tides 2009-2756 vas determined (Figure 9). Following this the 
sequence of a part of pATIX108.2, specifically uucleotidea 1950-2086, 
provided the overlap vith pATIXcVII, The remaining 28 hybridisation 
positive clones were screened by carrying out a triple enzymatic 
digestion with the restriction enzymes EcoR I. BamH I and Hindi I I 
and screening the product of the digest for an EcoRI restriction 
fragment extending in the 3 1 direction from the cut at position 2480. 
By this approach, clone pATIX108.3 vaa aelected and aequenced from 
nucleotldea 2642-2778. This clone vas followed by three A nucleo- 
tides! vhlch sequence vaa confirmed aa a veatlgial marker for the 
poly A tail, by the subsequent isolation of clone pATIXDB by a 
similar method. pATIXDB vaa sequenced from Nos. 2760-2778 and 
ended in 42 A nucleotides, thus marking the 3* end of the mRNA. 

Figure 9 shows that the predicted amino acid aequence codes 
for a protein of 456 amino aclda, but Included in thla are 41 
residues of precursor amino acid aequence preceding the N-terminal 
tyrosine residue (Y) of the definitive length factor IX protein. 
The precursor section of the protein shows s basic amino acid 
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domain (amino acids -1 to -A) as veil ss the more usual hydrophobic 
signal peptide domain (amino acids -21 to -36). 

The definitive factor IX protein consists of 415 amino acids 
with 12 potential gamma-carboxyglutamic acid residues at amino 

05 acids 7, 8, 15, 17 v 20, 21, 26 v 27, 30, 31, 36 and AO. Two 
potential carbohydrate attachment sites occur at amino acid 
residues 1S7 and 167, The activation peptide encompasses 
reslduss 146-180, which are cut out in the activation of Factor XX 
(see Background of Invention) by the peptide cleavage of an R-A 

10 end R-V bond. This leaves a light chain spanning residues 1-143 
and a heavy chain spanning residues 181-415, 

o The exact location of the boundaries between exons (see 
Section H, above) and how they are joined In the mRNA Is marked in 
Figure 9. The exons are marked t, u, v, w, x, y, c. It can be 

15 seen that there Is a rough agreement between the exon domains and 
the protein region**. For example, the exon for the signal peptide 
is distinct from that of the GLA region. Also that of the activa- 
tion peptide ie separated from the serine protease domain. 

The 3' non-coding region of the mRNA is extensive 9 consisting 

20 of 1390 rssidues (including the UAAUGA double terminator 1389-1394 
but excluding the poly A tall). 

The factor IX cDNA Is cleavable by the restriction enzyme 
Hae lll to give a fragment from nuclo tides 133-1 A AO, i.e. a 1307 
nuclotlde long region of DNA entirely encompassing the definitive 

?* factor IX protein sequence. The nucleotide sequence recognised by 
Haelll is GGCC. This fragment should be a suitable starting material 
for the expression of factor IX protein from suitable promoters in 
bacterial, yeast pf mammalian cells. Another suitable fragment 
could be derived using the unique StuI site at residue Al (corres- 

30 ponding to an early part of the hydrophobic signal peptide region) 
and linking It to a suitable promoter. The nucleotide eequence 
recognised by StuI is AGGCCT. 

M. Southern Analysis of normal and patient Christmas disease DNA 
(1) Normal 

35 The etandard (Southern) blotting procedure. Southern, J.Mol. 

Biol. 98, 503-517, 1975) was used. In a typical experiment, 10-20 
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micrograms of human genomic DNA (prepared from uncultured blood 
colls or cultured lymphocytic cells) were digested with one of a 
number of restriction endonucleases and loaded onto a single gel 
slot. Following electrophoresis on 0.8% agarose gel and tranafer 
05 onto nitrocellulose It was hybridised with a probe of 32 P- labelled 
probe II or of 1. A kb EcoRI fragment (see Section H), Labelling 
of the probe waa carried out by nick translation using the method 
of Rigby et al., supra , modified as follows. About 100 nanograms 
of the probe was mixed with 40 mlcrocurles of (alpha 32 P] dATP 
10 (activity about 3,000 Curies/mMole, obtained from Amersham 

International PLC) in 0.05M Tris-HCl, pH 7.5, 0.01M HgClj, 0.001M 
dithiothreitol and dCTP, dGTP, dTTP each at a final concentration 
of 20 micromolar in a volume of 29 mlcrolltres. To this was 
added 1 mlcrolitre of "solution X" made up of a mixture of 6 units 
15 of DNA polymerase I (E. coli) . 0.6 nanograms of pancreatic DNaae I 
(Worthington) , 1 microgram of crystalline BSA in 10 mlcrolltres 
- of 501 v/v glycerol containing 0.05M Tris-HCl, pH 7.5 f 0.01M KgCl 2 
and 0.001M dithiothreitol. The mixture was Incubated for 2 hours 
at 15 C t after vhich high molecular weight DNA was purified by 
SO chromatography on G-100 "Sephadex". Figure 13 shows the major 

bands obtained with DNA from normal individuals probed with either 
probe II (Figure 6) or labelled 1.4 kb EcoRI fragment. With each 
of the 4 ensymes used, EcoRI, Hindi II , Bglll and Bell, a single 
major band of about 4.8, 5.2, 11 and 1.7 kb was obtained. 
5 The fact that these restriction fragments had the same length 

as those observed in the restriction map of clone lambda HIX-1 
confirmed that the conditions of Southern blotting were precise 
enough to detect the factor IX gene in total DNA preparations. 
This provides the basis for analysis of DNA from the blood of 
30 patients with Christmas disease. 

(**) Christmas patients with gene deletions 
The value of the probes of the invention for the assay of 
alterations of genes of some patients suffering from Christmas 
disease has been demonstrated as follows. Two patients with 
35 severe Christmas disease, who also developed antibodies to 

factor IX, were selected for study. The DNA from 50 ml of blood 
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was digested separately vlth EcoRI, Hlndlll, Bftlll and Bell and 
Southern biota prepared for probing with 32 P-nick translated 
probe II (Figure 6). No specific bands were observed with either 
1 1 patient under conditions where a control digest gave the pattern 
05 shown in Figure 13. Similarly no bands were observed in the 
patients when probe I, III or IV (Figure 6) waa substituted for 
probe II. In order to control for possible mischance of some 
experimental artefact giving the observed f negative 9 signal, a 
factor IX gene probe (thia time pATIXcVII - the cDNA probe) was 
10 mixed with an Irrelevant autosomal gene probe which was specific 
for the human Al apollpoproteln (Shoulders and Baralle, Nud. Acids 
Res. iO t 4873-4882. 1982). This experiment showed that patient 1 
had the normal Al apollpoproteln gene, characterised by a 12 kb 
band in an EcoRI digeet, and confirmed that he lacked the 5.3 kb 
15 band observed with pATIXcVII and characteristic of the normal 
factor IX gene. It was concluded that both patients have a 
aequence of at least 18 kb deleted from their factor IX gene. Two 
other patlenta, designated patients 3 and 4, who had also developed ' 
antibodies to factor IX gave bands in the normal or abnormal 
20 positions on Southern blots with some factor IX gene probes of the 
invention, but not with others. This suggested that these patients 
had less extensive deletions of the gene, possibly about 9 kb in 
length. 

These results suggest that diagnosis of haemophiliacs and the 
25 heterozygous (carrier) females would be possible in families and 
this is now under examination. The altered pattern seen in the 
patient's DNA, whether absence of a band or the presence of a band 
in an abnormal position, serves as a "disease marker", which can 
be used to assess for its presence or absence in a suspected 
30 carrier. This same test can be applied to antenatal diagnosis, if 
DNA from foetal cells are available from an amniocentesis. "Genetic 
diagnosis" should considerably improve existing methods of antenatal 
diagnosis based on the assay of foetal factor IX protein levels, 
with the added advantage that the test can be carried out earlier 
35 in pregnancy. Genetic methods using natural polymorphisms within 
the factor IX gene as allelic markers should alao make 100X carrier 
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deletion a reality, thereby Improving the existing somewhat 
unsatisfactory aethoda where probability valuaa ara of farad to 
patients. 

Deposits have been made at the National Collection of 
Industrial Bacteria, Torry Research Station, P 0 Box 31, 
135 Abbey Road t Aberdeen AB9 8DC Scotland, aa indicated in the 
Claims hereinafter. Also E. coli K-12 strain 803 mentioned above, 
which la a suitable hoat for the lambda HlX-lb phage, haa bean 
depoaltad at the NCIB on 26 July 1982 under Acceaalon No. 11732. 
All deposits were made under the provisions of the Budapest Treaty 
on the International Recognition of the Deposit of Micro-organisms 
for the Purpoaea of Patent Procedure. 
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CUIMS 

1. Recombinant DNA which comprises a cloning vehicle DNA 
sequence and « DNA sequence foreign to the cloning vehicle, the 
foreign sequence comprising substantially the following 129- 
nucleotlde sequence (read in rovs of 30 across the pags)i- 



05 



ATCTAACATG 
CCGAGCAGTT 
ACAAGGTGGT 
AXCCACTTCC 
AACCAGCAG 



TAACATTAAG 
TTGTAAAAAT 
TTGCTCCTGT 
AGAAAACCAG 



AATGCCAGAT 
AGTGCTGATA 
ACTGAGGGAT 
AAGTCCTGTG 



10 



2. Recombinant DNA which comprises a cloning vehicle DNA 
sequence and a DNA sequence foreign to the cloning vehicle, the 
foreign sequence comprising substantially'the following 203- 
nucleotlde sequence (read in rows of 30 across the page)i- 



15 



20 



TGCCATTTCC 
CACAAACTTC 
CTGTTTTTCC 
CTACTGAAGC 
TCACTCAAAG 
TCACTCGGGT 
AACCAGGTCA 



ATGTGGAAGA 
TAAGCTCACC 
TGATGTGGAC 
TGAAACCATT 
CACCCAATCA 
TGTTGGTGGA 
ATTCCCTTGG 



GTTTCTCTTI 
CGTGCTGAGG 
TATGTAAATT 
TTGGATAACA 
TTTAATGACT 
GAAGATGCCA 
GAG 



3. Recombinant DNA which comprises a cloning vehicle DNA 
sequence and a sequence foreign to the cloning vehicle, the 
foreign sequence being substantially the same as a sequence 
occurring in the human factor IX genome. 
25 A. Recombinant DNA according to Claim 3 wherein the human 
factor IX sequence has a length of at least 50 nucleotides. 

5. Recombinant DNA according to Claim 3 wherein the length of 
the human factor IX sequence is from 75 to 27,000 nucleotides. 

6, Recombinant DNA which comprises a cloning vehicle sequence 
30 and a DNA sequence foreign to the cloning vehicle, wherein the 
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foreign sequence Includes substantially the whole of en exon 
sequence of the human fector IX genome. 

7. Recombinant DNA which comprises s cloning vehicle eequence 
end s DNA sequence foreign to the cloning vehicle, wherein the 
foreign sequence comprises e DNA sequence vhlch Is complementary 
to the human factor IX mRNA. 

8. Recombinant DNA according to Claim 3, 4 or 5, wherein the 
cloning vehicle Is a modified pAT153 plasmld prepared by lighting 
s BamHI and Hindi II double digest of pAT153 to a pair of comple- 
mentary double sticky-ended oligonucleotides having a DNA sequence 
providing a BamH I restriction residue at one end, a Hindi I I restric- 
tion residue at the other end and a Pviill restriction site in 
between. 

9. Recombinant DNA according to Claim 8 wherein the pair of 
complementary oligonucleotides are of formulas- 

.3* GATCCAGCTGA 3» 



3' CTCGACTTCGA 3 1 

10. Recombinant DNA which compriees a cloning vehicle sequence 
and a DNA sequence foreign thereto which hybridises to a 247 
base-pair sequence of bovine factor IX DNA complementary to 
messenger RNA and Indicated In Figure 5 by the arrows at each end 
thereof. 

11. A host transformed with at least one molecule per cell of 
recombinant DNA claimed In any preceding claim. 

12. A host accbrding to Claim 11 in the form of E. coll . 

13. A host according to Claim 11 In the form of mammalian tissue 
cells. 

14. A process of preparing a host transformed with recombinant 
DNA as claimed In any one of Claims 1 to 7, which process com- 
prises:-^ 

(1) synthesizing an oligodeoxynucleotlde probe having a nucleotide 
sequence comprising that occurring in bovine factor IX messenger 
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RNA coding for amino acids 70*75 or 348-352 of bovine factor IX 
and labelling the ollgodeoxynucleotlde to form a probe; 

(2) preparing complementary DNA to a mixture of bovine RNA; 

(3) inserting the complementary DNA in a cloning vehicle to form 
05 a mixture of recombinant bovine cDNAa; 

(A) transforming a host with said mixture of recombinant bovine 
cDNAs to form a library of clones and multiplying said clones; 

(5) probing the clones with the synthetic ollgodeoxynucleotlde 

i probe obtained in step 1 and isolating a resultant recombinant 

10 bovine factor IX cDNA-containlng clone; 

(6) digesting the recombinant bovine factor IX cDNA from said 
clone vlth one or more enzymes to produce a bovine factor IX 
cDNA molecule containing a shorter sequence of bovine factor IX 
DNA; and 

15 (7) probing a library of recombinant human genomic DNA in a 

. transformed host vlth the shorter sequence bovine factor IX . 
cDNA molecule, to hybridise the human genomic DNA to the said 
recombinant bovine factor IX DNA and isolating the resultant 
recombinant DNA-transformed host. 
20 15. A process of preparing a host transformed vlth recombinant 

DNA as claimed in Claim 1, 2 or 7 V which process comprises probing 
a library of clones containing recombinant DNA complementary to 
human mRNA with a probe comprising a labelled DNA comprising a 
sequence complementary to part or all of an exon region of the 
25 human factor IX genome. 

16. A DNA molecule comprising an at least 15 nucleotide long 
sequence of part or all of substantially the 129-nucleotlde sequence 
set forth in Claim 1. 

17. A DNA molecule comprising an at least 15 nucleotide long 

30 sequence of part or all of substantially the 203-nucleotlde sequence 
set forth in Claim 2. 

18. A DNA molecule comprising an at least 15 nucleotide long 
sequence of part only of the DNA sequence of the human factor IX 
genome. 
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19. A DNA molecule comprising a sequence of length at least 13 
nucleotides substantially the same as a sequence complementary to 
part or all of that occurring In human factor IX mRNA. 

20. A DNA molecule according to any one of Clalma 16 to 19 of 
05 length at least 50 nucleotides. 

21. An artificial DNA molecule comprising a sequence substan- 
tially the same as a sequence of length at least 15 nucleotides 

. occurring In the human factor IX genome. 

22. An artificial DNA molecule according to Claim 21 comprising 
10 substantially only-exon sequences. 

23. A labelled diagnostic probe comprising a DNA molecule having a 
single-stranded or double-stranded probe sequence of from 15 to 10,000 
nucleotides long of DNA sequence defined in Claim 16, 17, 18 or 19 

or Its complementary sequence. 
15 24. A probe according to Claim 23 having a probe sequence from 20 
to 5,000 nucleotides long. 

25. Recombinant DNA (being the phage present In the clone herein- 
before designated lambda HIX-l) deposited as a phage at the National 
Collection of Industrial Bacteria, Aberdeen, under Accession 

20 No. 11749 on 30 July 1982. 

26. Recombinant DNA according to Claim 3, wherein the cloning 
vehicle is the modified pAT153 plasmld (hereinbefore designated 
pAT153/Pvu II/8) present in the E. coll strain deposited at the 
National Collection of Industrial Bacteria, Aberdeen, under 

25 Accession No. 11747 on 19 July 1982. 

27. Recombinant DNA according to Claim 10 wherein the bovine 
factor IX DNA sequence is contained in the recombinant DfoA 
transformed Into E. coll to form the E. coll clone hereinbefore 
designated BIX-1 and deposited at the National Collection of 

30 Industrial Bacteria, Aberdeen under Accession No. 11748 on 
19 July 1982. 
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CLAIMS FOR AUSTRIA 
1. A process of preparing recombinant DNA In which a cloning 
vehicle DNA sequence la joined to a sequence foreign to the cloning 
vehicle, characterised In that the foreign sequence Is substantially ; 
the same as a sequence occurring in the human factor IX genome. 
05 2. A process according to Claim 1 characterised In that the 

cloning vehicle sequence Is joined to a foreign sequence which 1 
Includes substantially the whole of at least one axon sequence of } 
the human factor IX genome. i 

3. A process according to Claim 1 or 2 characterised In that the j 
10 cloning vehicle sequence is joined to a foreign sequence eomple- 1 

mentary to a sequence occurring in the human factor IX mMA. 1 

4. A process according to Claim 1 or 2 characterised in that the J 
human factor IX sequence has a length of at least 50 nucleotides. j 

5. A process according to Claim A characterised In that the 

5 human factor IX sequence has a length from 75 to 27,000 nucleotides. ' 

6. A process according to Claim 1, characterised in that the 
cloning vehicle DNA is joined to a foreign sequence which Includes • 
substantially the following 129-nucleotide sequence (read in rows ' 
of 30 across the page):- j 

0 ATCTAACATG TAACATTAAG AATGCCACAT i 

CCGACCACTT TTGTAAAAAT ACTGCTGATA j 

ACAAGGTGCT TTGCTCCTGT ACTGACGGAT 

ATCGACTTGC AGAAAACCAG AAGTCCTGTG 

AACCAGCAG 

5 7. A proceaa according to Claim 1, characterised in that tha 
cloning vehicle DNA sequence la joined to a foralgn sequence which 
Includes substantially the following 203-nudeotlde aequence (read 
la rowa of 30 across tha page)t- 

TCCCATTTCC ATGTGCAAGA GTTTCTGTTT 

D CACAAACTTC TAAGCTCACC CCTGCTGAGG 

CTGTTTTTCC TCATGTGCAC TATCTAAATT 
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C1ACTCAACC 



TGAAACCATT 



TTCCATAACA 



TCACTCAAAG 
TCACTCCGCT 



CACCCAATCA 
TCTTCCTCGA 



TTTAATGACT 



GAAGATGCCA 



AACCAGGTCA 



ATTCCCTTGG 



CAG 



8. A process according to Claim 1 characterised in that the 

05 cloning vehicle sequence is joined to a foreign DNA sequence which 
is capable of hybridising to a 247 base-pair sequence of bovine 
factor IX DNA complementary to messenger RNA and indicated in 
Figure 3 of the drawings by the arrows at each end thereof. 

9. A process according to Claim 8 characterlssd in that the 
10 recombinant DNA prepared is that deposited as a phage at the 

National Collection of Industrial Bacteria, Aberdeen, under 
Accession No. 11749 on 30 July 1982, which is hereinbefore 
designated as the phage present in the clone lambda HIX-1. 

10. A process according to Claim 8 characterised in that the 

5 bovine factor IX DNA sequence is contained in the recombinant DNA 
transformed into E. coll to form the E. coll clone hereinbefore 
designated BIX-1 and deposited at the National Collection of 
Industrial Bacteria, Aberdeen under Accession No. 11748 on 
19 July 1982. 

0 11. A process according to any preceding claim characterised in 
that a sequence of a cloning vehicle which is a modified pAT153 
plasmld prepared by ligating a BamHI And Hindlll double digest of 
PAT153 to a pair of complementary double sticky-ended oligonucleo- 
tides having a DNA sequence providing a BamHI restriction residue 

5 at one end, a Hindi! I restriction residue at the other end and a 

PvuII restriction site in between, is Joined to the foreign sequence. 
12. A process according to Claim 11 characterised in that the 
pair of complementary oligonucleotides are of formula t- 



GATCCAGCTGA 



30 



3 f GTCCACTTCGA 5 9 
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13. A process according to Claim 11, characterised In that a 
cloning vehicle sequence vhlch Is derived from the modified pAT153 
plasmld (hereinbefore designated pAT153/Pvu II/8) present In the 
E. coll strain deposited at the National Collection of Industrial 

05 Bacteria, Aberdeen, under Accession No. 11747 on 19 July 1982, Is 1 
joined to the foreign sequence. 

14. A process of preparing a host transformed vlth recombinant 
DNA characterised In that a host Is transformed vlth recombinant 
DNA according to any one of Claims 1 to 13. 

10 15. A process according to Claim 14 characterised In that the 
host Is In the form of E. coll . 

16. A process according to Claim 14 characterised In that the 
hoat la In the form of mammalian tissue cells. 

17. A process according to Claim 14 characterised In that a 

15 transformation - competent host Is Incubated with recombinant DNA 
claimed In any one of Claims 1 to 13. 

18. A process according to Claim 14 characterised in that a 
library of clones containing recombinant DNA complementary to 
human mRNA Is probed with a probe comprising a labelled DNA 

20 comprising a human factor IX DNA sequence defined In any one of 
Claims 1 to 10. 

19. A process of preparing a diagnostic probe characterised in 
that a label la attached to a DNA. molecule having a single-stranded 
or double-stranded probe sequence of from 15 to 10,000 nucleotides 

25 long of part of the DNA sequence defined In Claim 1 or its comple- 
mentary sequence. 

20. A process according to Claim 19 characterised in that the 
probe sequence is from 20 to 5,000 nucleotides long. 

21. A process according to Claim 19 characterised in that the 

30 probe sequence includes an at least 15 nucleotide long sequence of 
part or all of substantially the 129-nucleotide sequence set forth_ 
in Claim 6. 

22. A proceaa according to Claim 19 chacterised in that the probe . 
sequence includes an at least 15 nucleotide long sequence of part 

35 or all of substantially the 203-nucleotlde sequence set forth in 
Claim 7. 
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23. A process according to Claim 19 or 20 characterised in that 
the probe sequence includes substantially only axon sequence(s) of 
the human factor IX genome. 

24. A process according to Claim 19 characterised in that the 
probe sequence is substantially the same as a sequence complementary 
to part or all of that occurring in human factor IX mRNA. 
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1st amine acid 70 75 

sequence : Olu-Cys-Trp-Cys-Oln-Ala 

: 5 ' gaJ ugJ ugg ugJ caJ GCN 3' 

Deoxyoligonucleotidea 3 ' ClJ AcJ ACC AcJ GTT CO (olige N2A) 



3* CtJ AcJ ACC AcJ GTC CO (oligo N2B) 



2nd amino acid 348 352 

sequence : Hie-Met-Phe-Cys-Ala 

mRNA : 5' ca U Aim im U n/? u 



Deoxyol igonucleot i des 

synthesized : GT* TAC AA* ac a 



' CA C AUG Ul£ UGj GCN 3 « 

CTj TAC AAj ACJ CO (oligo Nl) 



Kg./ 
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60 

SNPCLNOGMCKDDI.NSY 
TGAATCCAATCCATGTTTAAATGGCGGCATGTGCAAGGATOACATTAATTCCTAT 
10 20 30 40 50 

70 80 90 

ECW CQACFEGTNC ELDATCSXK 
GAATGTTGGTGTCAAGCT^aATTTQAAQCAAfinAArTr.T^AATTAfiATftrA^r^T^^^^j^yy^ 

€0 70 80 90 100 1X0 120 

100 

NGRCKQFCKRDTDNKVVC 
GAATGGCAGATGCAAGCAGTTTTGTAAAAGGGACACAGATAACAAGGTGGTTTGT 
130 140 150 160 170 

no 120 130 

sctdoyrlaedoksc e p a v p p p 
tcctgtactgacggataccgacttgcagaagaccaaaagtcctgcgaaccagcagt4ccatttcc 

*»U 190 200 210 220 230 ' 240 

140 iso 
c g r v s v s hTv RPRFHGLCSC»E' 
CTGTGGACGAGTCTCTGTCTCACATpTGAGGCCCCGCTTTCACGGTCTGTGTTCGTGCTGAGAA 
2* 260 L 270 280 290 < 300 
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FIG. 8(a) 
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FIG. 8(c) 
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FIG. 8(f) 
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FIG. 8(g) 
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FIG. 8(h) 
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FIG. 8(k) 
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FIG. 8(1) 
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FIG. 6(m) 
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FIG. 8(n) 
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